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Introduction 

Panel  data  generally  provide  brand  purchase  information 
for  a  large  number  of  individuals  over  several  months,  and  they 
sometimes  also  include  individuals'  ratings  of  the  brand  on  some 
attitude  scules,  or  the  exposure  to  the  different  brands1  advertising 
Such  disaggregated  data  makes  several  types  of  analysis  possible 
--  one  can  identify  ever-time  patterns,  or  similarities  between 
individuals,  or  regularities  in  brand  purchase,  for  example  -- 
and  there  have  bean  several  recent  developments  of  statistical 
agorithms  designed  to  utilize  the  information  available  as 
efficiently  as  possible.  This  paper  surveys  these  new  techniques, 
focusing  especially  upon  the  considerations  that  help  the  data 
analyst  to  choose  one  technique*  ever  another. 

Preliminaries 

In  what  followr  we  assume  that  the  data  available  are  of 
the  following  nature.  A  panel  of  300  families  have  been  keeping 
traci:  of  c^oir  weekly  purchases  for  a  two  year  period,  indicating 
in  their  diaries  how  much  was  bought  and  at  what  price  of  the 
various  products  and  brands. 

Wo  assume  that  the  objective  of  the  dsta  analysis  is  to 
assess  ths  role  cf  price  in  individual's  purchase  decisions  for 
one  of  the  product  groups  represented  in  the  panel  diaries, 
coffee,  say.  As  we  are  interested  mainly  in  "our"  brand,  the 
modeling  is  focused  upon  determining  price  effects  for  that 
brand  of  coffee. 
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Next,  the  dependent  and  independent  variables  measurements 
will  have  to  be  settled  on.,  Cutting  a  long  discussion  short,  it 
is  assumed  here  that  a  three  week  period  is  found  sufficient  for 
coffee  re-purchasing  and  that  the  price  and  deal  effects  are 
played  out  within  that  time  span.  Thus,  it  becomes  acceptable 
to  define  a  dependent  variable  for  each  individual  per  every  three 
weeks  which  is  scored  a  "1"  when  our  brand  is  purchased,  a  "0"  other- 
wise .   The.  quantity  bought  in   disregarded.   Similarly,  the  price 
variable  is  measured  as  our  brand's  price  to  the  individual 
relative  to  the  mean  market  price  for  all  purchasers  during 
that  three  week  period.  Deals  are  translated  into  dollar 
"equivalents"  and  subtracted  from  the  price  before  relative 
prices  are  computed. 

An  Aggregate  Model 

With  these  definitions  and  measurements  of  purchase  and 

price,  a  simple  regression  model  is  developed.  It  relates  the 

relative  frequencies  of  purchases  of  our  brand  (cur  "market 

share")  linearly  to  our  relative  price  and  a  constant: 

CD      P0t  -  c0-   0<r0/r.)t+  u0t  ,  t-  1.....T  , 

vhe  e  pfl  denotes  our  market  share  at  time  t,  c*  is  a  constant 

unique  to  oiu"  brand,  efl  is  a  slope  parameter  alco  unique  to  our 

brand j  rQ  is  our  price  and  r  is  mean  price  for  our  competitors, 

uQ  is  a  disturbance  term,  made  up  partly  of  left  out  variables. 

The  two  unknown  parameters  crt  and  a„  are  to  be  inferred  on  the 

0      0 


Thus,  a  simple  "market  share"  measure  for  our  brand  is  any  three- 
week  time  period  is  the  number  of  l'a  relative  frequency  to  the 
total  number  of  -purchasers. 
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basis  of  tha  34  time  periods  of  observations  available.  Since 
p_   is  a  relative  frequency  measure »  u   cannot  be  assumed  to 
have  a  normal  distribution  --  a<  lordingly,  the  modal  is  rewritten 
as 

<2)       fot/(1*potJ    =  co  +  Vro/r->t   +  v0t  ' 

which  can  be  estimated  using  ordinary  least  squares  (see  Johansson, 
1973).  This  model  is  no  longer  linear,  but  exhibits  a  diminishing 
return  shape  for  positive  relative  prices.   In  management's 
judgment  tiiis  representation  is  batter  than  the  linear  formulation, 
since  some  customers  will  always  buy  regardless  of  price. 

A_  D  isjagg  ?£BP.  t  g_  jjo  del 

We  might  feel  a  little  uncomfortable  about:  some  of  the 
assumptions  involved  in  the  Initial  model,  however.   In  particular, 
the  assumption  that  individuals  are  homogenous  with  respect  to 
their  price  sensitivity  might  be  unwarranted.  In  such  a  case,  one 
would  hpve  to  carr,  out  the  analysis  accounting  for  such 
differences  to  exist. 

Let.  sfine  an  individualized  measure  of 

the  dependt  Lch  gives  the  amount  of  out- 

brand  bought;  during  each  eight  week  period  relative  to  the.  total 
amount  of  coffee  bought  by  the  individual  during  these  weeks.  This 
measure  would  be  the  individual  counterpart,  of  the  relative 
frequency  measure  used  above,  and.  the  problem  with  it  would  be 
that  even  with  eight  weeks  it  would  tend  to  cluster  around 
aero  and  one.   But  let  us  disregard  that  problem  here  and  gc 
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on  to  the  more  general  estimation  problem. 

Redefining  the  relative  price  of  our  brand  to  correspond  to 
the  eight  week  period,  we  could  ;un  each  individual  separately 
according  to 

<3>     Pou/(1-<W  =  C0i  +  A0i(r01/r-t  +  vOit     •  *  =  x 13 

1=1 n '   , 

with  the  subscript  i  denoting  the  individual.   The  sample 
comprises  n'<  300  individuals,  and  the  time  period  is  equal  to 
eight  weeks.   It  should  be  noted  that  P-it.  has  to  be  strictly 
less  than  1.0,  otherwise  the  left  hand  side  becomes  undefined. 
Thus,  if  we  have  completely  brand  loyal  individuals  in  the  sample, 
model  (3)  is  inappropriate  --  a  version  of  model  (1)  might  have 
to  be  used  however  unsound  it  is  statistically. 

Some  Analysis  of  Covariance  Models 

The  disaggregated  approach  will  give  us  13  observations 
per  regression  equation,  a  fairly  small  number  considering  the 
originally  available  104  weeks.  Accordingly,  it  might  be  useful 
to  consider  some  other  ways  in  which  the  individual  differences 
could  be  accounted  for.  A  cotjmon  assumption  is  to  assume  that 
the  difference  is  mainly  in  tarms  of  the  intercept,  the  slope 
coefficient  (and  thus  the  marginal  effect  of  price)  being  the 
same  for  each  individual.  This  leads  to  the  model 

(4)    P0itm-P0it}  -  CQi  +  A0(r0i/r.)t  +  vQit  ,  t  -  1.....13 

__________  i  *  l,..,nf 

We  assume  that  every  panel  member  included  has  bought  our  brand 
at  least  once  during  the  8  week  period. 
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When  running   this  model   to  get   the  estimates   of   the  C^.   and   the 

Aft   (a    total  of  nf+l  unknown  parameters)   the  usual  approach  is    to 

1 

introduce  separate  dummy  variables  for  each  of  the  individuals* 

This  is  a  considerable  saving  of  observations  compared  to  the  2n* 
parameters  to  be  estimated  in  model  (3).  Additionally.,  if  we 
are  not  interested  in  the  intercepts  but  only  in  the  elope  An> 
using  deviations  of  the  relative  price  from  their  respective 
individual  mean  will  give  m-    the  identical  estimate  without  the 
nf  constants,,  thus  preserving  degrees  of  freedom  even  more. 

It  is  easily  seen  from  (3)  that  the  choice  of  recognizing 
individual  differences  can  easily  be  changed  into  allowing 
differences  over  time.   If  it  is  assumed  that  the  intercepts 
are  fairly  constant  across  some  (homogeneous)  individuals,  but 
may  be  changing  as  time  goes  by  (e.g.  because  of  the  product  life 
cycle  progression),  the  same  approach  as  the  one.  used  to  derive 
(3)  and  (4)  but  working  with  deviations  from  over-time  means 
will  allow  for  such  changes  to  surface  in  the  estimation. 

Furthermore,  it  might  be  desirable  to  allow  for  Individual 
effects  as  well  st  time  effects  through  differences  in  intercepts. 
This  would  be  the  c&se  where  the  purchases  vary  across  individuals 
but  also  in  come  level  ever  time.  That  is,  the  individuals  differ 
in   a  stable  manner  throughout  the  time  period  under  observation, 
while  all  their  purchases  are  shifted  ("on  the  mean-1'  as  it  were) 
from  time  period  to   time  period.  Then  the  correct  model  would  be 

(5)   P0tt/(1-P01t)  -  C0_  +  Vr0jL/r.)t  ':'  "oi  +  v0t  +  e0K  ' 


1 

Notice  that  this  approach  is  commonly  termed  the  analysis  or 
covariance.  The  intercepts  represent  "main"  effects,  while 
price  is  "covaried  out",  in  the  language  of  the  analysis  of 
covariance.  Here  we  are  interested  more  in  the  covarying 
va r iables '  effect. 
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with  i  and  t  indexed  as  in  (4) ,  0         repre! 

intercept,  and  with  u,  v,        i  te  error  terms  representing  the 
"individual  efj  f  feet" ,  and  ;.     Brail  error  term, 

respectively.  As  error  term  the  u  and  the  v  ore  random  variables, 
but  one  car*,  equivalently  interpret  t     is  fixed  parameters  to  be 
estimated  as  intercepts,   tfith  that  view  the  estimation  is  rather 
straightforward  generalizing  from  the  earlier  approach.  We  simply 
work  with  variables  as  deviations  from  the  individual  means,  the 
time  means,  and  the  overall  mean. 

Thus,  if  we  are  not  interested  in  the  varying  intercepts 
but  only  the  price  coefficient,  each  observation  on  the  independent 
price  variable  (and  correspondingly  for  the  dependent  variable) 
is  transformed  by 

(6)    <R01/K.)t  •  <V  >t  -  ".  .        > 

where  r.  is  the  raean  price  at  t   over  all  individuals,  r     the 

mean  price  for  individual  i  across  all  time  periods,  ?:.\d   r.   is 

r  '  it 

the  overall   mean  fo:  ha  fchi     overall 

mean  has   to  be  ing     the   two  "suV-means 

r.   and    e  ct  is  >d     twice  also. 

The  uj         .  cd  l 

*'J        *0:i  t1  "       Mt; 

For  this   extei  sion  riance   it    is  nec- 

essary that 

n'  T 

(8)   .2.  u-,  «  0  and  2,  v.  .  ■  0,  which  holds  since  Qn     is  made  to 
1 ~i   Ui         fc*j   Ot  u . 

represent  the        .  an. 
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An  Error  Components  Model 

In  the  cases  where  one  would  have  an  interest  in  the  actual 
magnitudes  of  the  individual  and  time  effects  --  perhaps  to  gain 
an  understanding  of  how  homogeneous  the  sampled  individuals  might 
be,  and  also  to  .  ~    '   great  char     In  the  brand's  market  share 
occurs  between  periods  --  it  will  be  necessary  to  explicitly 
introduce  the  dummy  variables  representing  the  intercepts  in  the 
estimation.   Compared  to  the  case  depicted  in  (7)  this  leads  to 
a  considerable  drop  in  the  degrees  of  freedom,  with  n!  +  T  new 
parameters  to  estimate.   In  addition,  one  would  think  that  if 
these  variations  between  time  period  and  individuals  are  to  be 
taken  directly  into  account ,  some  use  of  them  could  also  be  made 
in  the  estimation  of  the  slope  coefficient  for  price . 

These  two  considerations  together  with  others  --  do  we 
really  know  what  to  do  with  ail  those  dummy  variables?  --  have 
prompted  econcmetri  svelop  in  recent  years  other 

estimation  techniques  for  the  "  rror"  or  "variance  components" 
models  as  they  are  celled  (see  e.g.  Nerlove,  19/1,  ^na   Maddsla, 
1971).   Basically-  the  ^.'r.y.nt   of  the  several  alternative  methods 
developed  has  been  to  regard  the  u-,.  and  v«  i.n  (5)  zs   random 
variables  (just  as  eft.  ,  the  overall  error  terra  is  regarded), 
and  then  to  estimate  ch*  parameters  necessary  to  determine  the 
probability  distributions  of  the  two  error  terns.  These  dis- 
tributions  are  generally  aLSuraed  to  be  normal,  and  then  a 
specification  of  the  mean  and  the  variance  will  suffice  to 
answer  the  question  as  to  how  large  the  variations  between 
individuals  end  over  time  really  are. 
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There  are  several  competing  estimation  methods  for  the 
error  components  model,  each  one  having  some  advantage  depending 
upon  the.  exact  specif icaticns  01   the  statistical  model,  The  case 
with  which  we  are  dealing  here  is  one  of  the  simpler  ones.  The 
regressors  are  assumed  non-stochastic >  thereby  excluding  the  case 
where  we  introduce  a  lagged  dependent  variable  on  the  right  hand  side 
for  example,  and  we  further  assume  that 

1.  E(u01)  «E(v0t)  -K.0lt>  -0 

2.  V(uQ1)  -  *l 

3.  V(v0t)  «  a2v 

4.  V(en..)  -  o2l     , 

Qit     e 

where  1  if.  the  identity  matrix,  of  order  n'T.  These  random 
variables  are  furthermore  assumed  independent  and  identically 
distributed,  and  it  is  required  that  n!>K  as  well  as  T>K,  where 
K  stands  for  the  number  of  independent  variables  in  the  model. 

Most  of  these  assumptions  are  direct  analogues  to  the  usual 
multiple  regression  case.  Under  these  assumptions,  the  approach 
suggested  by  Swamy  (1971,  Gh.  3)  provides  the  meet  straightforward 
solution  to  the  estimation  problem.  To  get  at  the  individual 
effect,  we  simply  use  ordinary  least  squares  to  estimate 
(8)    (P0it../(1-P0it»t  -  Cot  +  AQ  (rQ  /T.)t}1   +  u0.  _ 

where  the  added  i  subscript  indicates  that  the  variables  used 
are  the  individuals  over  time  means.  *  Thus,  uhis  regression 
can  be  estimated  using  a1  observations.  As  indicated  in  (8), 


Since  we  are  not  interested  here  in  the  constsiits  Oq^.  ,  1*1,.... cf;  we 
can  furthermore  use  the  deviations  from  the  meau  across  individuals  f< 
the  estimafcioa  of  ( 
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the  error  term  does  represent  the  Individual  effect,  end  an 
estimate  of  its  variance  o~  (its  mean  is  aero  by  virtue  of  the 
ordinary  least  square  fit)  can  e  derived  in  the  usual  way 
from  the  squared  residuals  divided  by  n8~K?  summed  over  the 
individuals. 

In  the  same  manner  we  can  get  &a  estimate   of  the  variance  of 
v„.  the  time  effect,  by  running  a  regression  using  variables  mea- 

(Jt 

sured  (again  as  deviations  from  the  mean)  over  individuals  for  each 
time  period.  The  number  of  observations  acre  will  be.  T,  and 
the  residuals  can  be  used  to  generate  an  unbiased  estimate  of  the 
desired  parameter.   Finally,  using  overall  deviations  as  indicated 

in  (6)  and  (?) ,  ordinary  least  squares  will  give  an  estimate  of 

2 
o* 
u. 

But  as  indicated,  one  reason  for  the  use  of  error  components 
rcodeis  was  the  possibility  of  a  more  efficient  estimate  of  the 
•slope  parameter (s)  by  fa    r   use  of  the  inter- individual,  over- 
time variations.,   In  this  Instance  this  efficiency  comes  about 
by  a  pooling  of  tl    iveral  estimates  of  the.  slope  coefficient,  A  , 
that  we  have  generated.  As  should  be  clear  from  the  preceding 
description,  the  approach  will  yield  one  estimate  of  A* 
each  time  the  variance  estimation  is  carried  out, 
Swainy  (1971,  p.  70)  suggests  a  simple  pooling  of  the  estimates 
with  weights  inversely  proportional  to  the  variances  derived,  and 
shows  that  this  is  an  Aitken  estimator,  with  the  desirable 
properties  of  unbiasedness  and  small  (although  not  necessarily  m 
mum)  variance. 
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A  "Seemingly  Unrelated"  Test  Model 

So  far  the  discussion  has  mainly  centered  on  time  and 
individual  effects  that  basically  reflect  themselves  in  different 
intercepts.   It  is  clear,  however,  that  in  many  cases  one  will 
want  to  allow  the  price  coefficient  to  vary  between  individuals 
and/or  over  time  as  well.  Reasons  for  the  individual  differences 
lie  with  the  possibility  that  price  sensitivities  might  vary 
between  people,  or  between  segments.   Similarly,  time  differences 
might  occur  because  of  the  product's  progress  through  the  life 
cycle,  making  price  considerations  more  or  less  salient. 

A  general  extension  of  the  analysis  of  covariance  use  of 
dummy  variables  allows  the  dummy  variables  to  be  introduced  as 
interactive  with  the  slope  coefficient  as  well.  The  procedure 
amounts  to  assigning  a   new  variable  the  value  one  for  "different" 
individuals,  and  the  value  zero  for  the  others.  Then  a  new 
regressor  is  area  Led  by  multiplying  this  new  dummy  variable  by 
the  price  variable,  and  introdu  ing  both  this  new  regressor  and 
the  original  price  variable  in  the  regression  model.   If  more 
than  one.  different  individual  or  group  of  individual  is  wanted, 
another  dummy  is  created  along  the  .-same  lines,  and  a  third 
regressor  created.  A  good  presentation  of  the  overall  approach 
is  given  by  Gujarat!  (1970),  end  for  an  interesting  marketing 
application  of  the  modal  the. "reader  is  referred  to  Winter  (1973). 
In  the  limit,  where  each  individual  is  seen  as  different  from 
everyone  else,  the  model  simply  computes  separate  regressions 
for  each. 
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As  before,  the  correspondence  to  over-time  differences 
follows  directly,  with  the  dummy  variables  now  representing 
different  time  periods.  Also  as  before,  the  use  of  the  dummy 
variables  grows  increasingly  undesirable,  as  the  differences 
increase,  since  the  degrees  of  freedom  quickly  go  down.   There- 
fore, a  development  of  a  technique  assuming  random  coefficients 
has  recently  been  undertaken  by  some  econometricians,  a  parallel 
development  to  the  error  components  solution  to  the  analysis  of 
covariance  case. 

Since  random  components  model  are  relatively  hard  to  estimate , 
it  is  usually  desirable  to  first  test  whether  or  not  differing 
slope  parameters  are  in  fact  desirable.   If  a   priori,  theoretical 
considerations  dictate  their  use,  no  empirical  test  is  strictly 
necessary.  However,  the  appropriate  empirical  test  to  use  is 
the  ''seemingly  unrelated"  model  developed  by  Ze liner  (1962). 
Here  separate  regressions  for  each  individual  unit  are  first  run 
to  generate  an  estimate,  of  the  variance-covariance  matrix  between 
different  individuals,  which  is  used  to  establish  a  more  efficient 
estimate  of  the  different  slope  parameters.  Then  an  approximate 
F-auatistic  is  generated  which  leviations  of  the 

individual  parameter  estimates  from  the  overall  mean  parameter 
vector  (with  our  single  pric       bles  the  vector  becomes  a 
scalar),  relative  to  the  residual  variations  "within"  the  individual 
over  tine,1  A  good  example  &£   a  marketing  application  of  this 
model  is  presented  by  Beckwiih  (1972), 

Most-  standard  econometric  and  statistical  computer  packages  now 
offer  the  "seemingly  unrelated"  model.   It  is  particularly 
appropriate,  where  separate  regressions  are  run  for  different 

individuals,  but  where  there  is  reason  to  believe  that  the  error 
term  in  each  equation  is  related  across  equations.  This  would  be 
the  case  where  the  factors,  other  than  relative  price,  that  affect, 
market  shares  would  be  the  same  across  individuals  (e.g.  the  effect 
of  the  introduction  of  fre.eze   dried  coffee)  . 
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Whether  or  rot  the.  test  indicates  that  the  parameters  in 
fact  do  differ »  one  would  generally  also  need  to  test  whether  or 
not  the>  are  random.  Such  a  tet  :  is  described  in  "wamy  (1971, 
pp.  122-124).   it  is  a  likelihood  ratio  test  which  requires  an 
iterative  estimation  procedure  which  so  far  is  not  generally 
available .  Therefore,  the  tendency  among  eccnometricians 
currently  seems  to  be  one  at   a  prlo        'ication  of  ehe  random- 
ness, and  then   estimation  of  the  parameters  directly. 

A  RftgAprc..  C. p.e  f  f  ic  ien t  Hod ej. 

if  the  test(s)  carried  out  indicates  that  differences  among 
individual  parameters  are  indeed  significant,  the  u.:e  of  a 
random  parameter  model  will  be  warranted.  We  will  here  only 
describe  the  simplest  such  specification,  but  the.  reader  should 
be  ax-sre   thai,  (much  az   in  the  case  of  error  composer      els) 
several  competing  alternatives  do  axist.  the  sppropr lateness  of 
which  depends  uvea  the.  particular  stochastic  nature  of  the  model, 

Th      ■  |  rh  tng  r.      :  ^efficient  models 

is  the  assumptioi      the  .  '   (2)  are  random 

vai ) 

(9)    A0i-£0+  i        .     X , 


with  the  randan  increment  dn,f  added  to  :1     rail  parameter 
mean  A-  .  Under  the  assumptions  that  the  dn. ,  1*1,. ..,n'  are 

identically  and  independently  distributed,  with  S(dn.) 

7 
and  V(d  )  *  fi",  and  the  usual  assumptions  of  the  multiple 

regression  model,  the  estimation  is  relatively  straightforward, 
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First  we  get  separate  estimates  (via  separate  runs)  of  the  indivi- 
dual  parameters,  A0l>  i»l»...ynf.  An  unbiased  estimate  of  A  is  deriv 

by  taking  a  weighted  estimate  o..  these  n  estimates,  the  weights 
inversely  proportional  to  their  variances.  Thus,  in  order  to  make  the 
procedure  operational  we  have  to  assess  these  variances. 

To  get  an  estimate  of  the  variance  we  first  need  to  account 
for  the  equation  disturbance  terms.  An  estimate  of  the  individual 
equation  variance  is  as  usual  given  by  the  residuals: 

(  Y  .  -  Y  ,)2 

(10)    ai±   «  -JLL—JZL-.         ,w V, 

where  s.,  is  the  variance  estimate  of  the  equation  for  the  i'th 

li  ^ 

individual,  1=1,..., n,  Y  .  is  the  actual  sample  value  of  the 

ai 

dependent  variable  (here.  pn.  /(I-pA. .)) ,  and  i    .    is  the  predicted 

Ul t     Oi t         pi 

value. 

Next  we  treat  the  individual  parameter  estimates  A\. ,  as  a 

random  sample  of  size  n  and  compute  its  variance  S. .   If  we 

A 

assume  the  parameter  variances  to  be  the  same  for  all  individuals, 

an  .estimate  of  the.  population  ~ -irameter  variance  would  be 


(11)  V(dot)  *  V(AQ)  -  SA/(n,-l). 

Accounting  for  the  equation  disturbances  allows  us  to  decrease  this 
variance,  however.  As  Swamy  (1971,  p.  107)  shows,  the  unbiased 
estimate  of  the  variance  is 

(12)  ¥(A  )  «  62  «  S./(n'-l)  -  ~  E   s   (X/X.)"1  , 

where  X.  refers  to  the  data  on  the  independent  variables  (here 
only  relative  price)  for  the.  i'th  individual. 
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/\ 
The  variances  of  the  estimates  An,  ,  1=1,..., n1,  are  then  the 

sum  of  the  variation,  in  the  parameters  themselves  and  the  sampling 

variation: 

(13)   V(A  >  «  62  +  ■  (X.'X.)"1    ,  i-l,../,n,l 

\J  i.  X 1     Jl     I 


where  the  sampling  variation  is  computed  similarly  to  the  usual 
regression  case . 

Finally,  we  derive  the  weights  of  the  estimators  A- , ,  i=l,...,n  , 
from  the  estimator  variance:. 

(14)  W(A01)«[J21  [e2+  •  (X^X  r1]"1]  [?+  s^O^'X.)"1]"1. 

Accordingly,  the  estimate  of  JL  becomes 

(15)  AQ  *  ^  A0i/tf(A0i)   ,  i«l,...,n', 

which  is  a  minimum  variance,  unbiased  estimator.   Its  variance  is 

given  by 

d6)  vdQ)  -  :14^2  +  •11»t,x1)"l3, 

that  is,  a  simple  sum  of  the  component  variances  (since  the 
individual  estimates  A,,.  i«l, . . .  ,n',  are  considered  independent, 
their  covariances  arc  zero). 

As  before,  the  approach  applies  directly  to  the  estimation 
of  parameters  where  l         fluctuations  occur  over  time  rather 
than  across  individual! 

Variable  Parameter  Regress  u>^  Models 

It  should  come  as  no  surprise  to  the  reader  that  the  next 
developments  in  panel  data  analysis  involves  slope  parameters  which 
vary  over  time  as  well  as  across  individuals.   One  development, 

by  Hsiao  (1973),  expresses  the  parameters  as   sums  of  time  effects 
and  individual  effects,  much  in  the  manner  of  the  covariance 


„"»  '■  ... 

models  developed  earlier.   In  another  development  by     berg  (1973), 
a  much  more  general  framework  is  presented.  Here  the  individual 
parameter  vectors  car  he  either  identical  across  Lie  population 
(or  a  subset  of  it)  or  differing  between  indiv;    s,  and  over  time 
the  parameters  are  seen  as  either  fixed  or  following  a  stochastic 
process.  The  differences  across  individuals  at  any  point  in  time 
are  treated  analogous  to     Swamy  approach  Just  presented.  The 
stochastic  process  that  represents  the  over  time  changes  allows 
for  many  different  specifications  provided  only  the  processes  are 
stationary.  The  number  of  time  periods  in  the  sample  would 
generally  set  an  upper  limit  on  the  order  of  the  process,  however j 
one  would  like  tc  have  several  repeated  realisations  of  the  complete 
process  in  the  data.  The  actual  form  of  the  process  can  either 
be  imposed  a  priori  from  theoretical  cons  Ida  be 

estimated  from  the  dot?  directly.  In  most  cases  the  process  would 
need  to  exhibit  •    ast  twe  particular  features:  a  random  shock 
term,  providing      eh      for  inter-indi-v  ual  differences, 
End  a   converge-     tctorj  allowing  for  individt         Lties  to 
assert  therasel         time.  ■      c  mi     Lsm  tsat 

will  do  this  wou  d  be 

Here  c  is  a  ci  <  1)        iting  the  rate  of  con- 

vergence  of  the  individuals  to  the— population  average  or  "norm" 

A  y   whereas  \  .cts  the  random  disturbance  (with  mean  aero) 

ot 

that  maker,  for  individual  diversity,   In  the  e:        Ls  i  en  as 
identical  for  at  least  seme  individuals  in   the  population, 
a  necessary  assumption  if  c  is  to  be  estimated  from  the  da 
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rather  than  specified  a  priori.  In  our  example.  the  diversity  among 
individuals  might  simply  arise  because  of  differing  price  sensi- 
tivity jmong  the  individuals.   he  convergence  towards  some  kind  of 
"norm"  or  average,  on  the  other  hand,  might  be  attributable  to  our 
brand  having  established  a  "norms    .  rLet   share  and  sales  level, 
where  the  role  of  price  1b  a  constant  parameter  across  the  buyers. 
As  one  would  suspect,  estimation  in  Rosenberg's  "convergent 

parameter"  model  becomes  quite  a  bit  more  complicated  than  in  the 

1 
Swamy  approach.   Basically,  the  specification  of  the  stochastic 

parameter  process  (16)  has  to  take  place  first.  Then,  estimates 
of  the  individual  parameters  at  anj  time  period  are  possible  to 
compute,  conditional  upon  the  process  specified.   The  method  used 
is  a  recursive  maximum  likelihood  procedure  which  starts  at  time 
t-1  and  then  "updates"  the  initial  estimates  through  succeeding 
time,  periods  up  through  T.   Since  only  one  rime  period  is  available 
at  t«l,  a  "starting  problem"  exists.   The  solution  is  di.' cussed  by 
Rosenberg  (1973)  at  length  (if  <-  Bayesian  approach  is  used,  these 
initial  values  would,  o£  course.      wen  by  an  informative  or 
diffused  prior)  and  we  xn.Il  no.     Into  it  here.  Then  these 
parameter  values  at  t -■':.  are  ised  in  the  steel      aechanism  (16) 
to  predict  next  period's  parameter  values.  These  new  parameter 
values  are.  then  .or  wi  "ions  on  the 

independent  variabilis  at  t-2  to  generate  a  forecast  of  the  depen- 
dent variable  for  :hat  period.  Finally,  the  difference  between 
this  forecasted  value  &nd   ths  actual  value  of  the  dependent      ble 


The  Rosenberg  model  will  give  the  Swsmy  solution  as  s   special  case 
(when  the  parameter  mean  stays  constant  over  tine,  and  c--:G  in  (16)). 
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at  t-2  is  used  to  revise  the  parameter  escimate.  for  that  period 
(if  this  difference  is  2ero,  no  revision  will  naturally  occur). 

For  more  information  on  tl.  ;  algorithm,  the  reader  is  referred 
to  the  Rosenberg  (1973)  paper.   Presently,  the  computer  program  is 
not  generally  available,  but  it  will  be  available  in  the  near 
future.   It  should  be  noteo  that  in  the  Rosenberg  approach  the 
explicit  values  of  the  different  individuals'  parameters  at  differ- 
ent points  in  time  will  be  available,  and  it  becomes  possible  to 
trace  the  development  of  one  particular  individual  throughout  the 
sample  history.   It  should  perhaps  also  be  noted  that  these  param- 
eters will  generally  be  different  for  different  lengths  of  the 
sample  history.   That  is,  the  revision  of  the  initial  estimate  on 
the  basis  of  the  forecast  error  mentioned,  can  be  carried  out  not 
only  for  the  present  period,  ^ut  also  for  earlier  periods,  back 
to  t«l. 

Extensions 

There  are  many  other  devel  pmenta  within  theP3  main  areas  that 
deserve  coverage  but  cannot  be  encompassed  hera   The  applied  re- 
searcher when  faced  with  statistical  specifications  which  deviate 
from  the  one       id   her.:-  should  be  able  to  use  the  references 
cited  to  go  further  and   formulate  the  appropriate  method.   These 
cases  include  the  problem  of  serial  correlation  in  random  coeffi- 
cient, models  which  is  well  treated  by  Swamy  (1971,  pp.  127-131), 
and  error  components  model  with  lagged  dependent  variables  intro- 
duced on  the  right  hand  Tide,  which  are  discussed  by  Nerlove  (1971). 

In  our  discussion  we  have  consistently  focused  upon  our  brand 
alone..   It  is  clear  that  an  identical  approach  can  serve  for  the 
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ana lysis  of  any  one  of  the  brands  in  the  market.   One  might,  how- 
ever, use.  the  data  across  brands  even  batter  if  the  brands  were  intro- 
duced as  a  dimension  equivalent  to  time  and  individuals.  Then  one 
could  derive  a  "brand*'  effect  in  exactly  the  same  manner  r.s  wa 
earlier  analyzed  "time"  and  "individual"  effects.   It  should  be 
clear  that  where  the  number  of  brands  is  small,  some  algorithms 
will  not  work.   For  the  random  coefficient  esei.mat2.0n,  for 
example,  it  is  essential  that  the  number  of  brands  be  greater 
than  the  number  of  independent  variables  in  the  model.   The  intro- 
duction of  across  brands  variations  in  the  variable  parameter 
model  would  create  computational  difficulties  (since  each  time 
period's  iteration  would  have  to  account  for  both  inter-individual 
and   brand  variation  in  the  parameters)  but  could  probably  bo.  done. 

Overall,  there  is  every  reason  to  hypothesize  that  in  the 
next  few  years  this  area  will  see  a  rapid  growth,  especially  with 
respect  to  the  availability  of  efficient  computing  algorithm;.-. 
Furthermore ,  the  marketers  ought  co  have  much  to  contribute  since 
in  many  respects  their  data  bases  are  the  most:  sophisticated  and 
complete..  After  all,  if  we  spend  all  that  money  collecting  the 
data,  we  ought  also    to  use  the  detailed  information  i-  contains 
to  the  fullest. 
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